IUPAC-IUB Commission on Biochemical Nomenclature 

Rules for Naming Synthetic Modifications of Natural Peptides 

Tentative Rules 



During the last, few years, chemists have made many com- 
pounds that are variants of naturally occurring peptides (or 
proteins) having trivial names. Therefore, the need has arisen 
for "semitrivial" names to designate these variants without the 
necessity of designating every residue in the chain. 

After discussion with active workers in the field, the following 
proposals are put forward; they are bawl on the names used 
by du Vigneaud and his collaborators (cf. Rodanszky and du 
Vigneaud, /. Am. Chem. Soc, 81, 1258 (1959); Popenoe, Lawler, 
and du Vigneaud, Am. Chem. Soc, 74, 3713 (1952)) and the 
symbols introduced by Schwyzer et al. (cf. Rittel, Iselin, Kap- 
peler, Riniker, and Schwyzer, Angew. Chem., 69, 179 (1957); 
Riniker and Schwyzer, Helv. Chim. Ada, 44, 685 (1961); see 
also J. Biol. Chem., 241, 2491 (1966); Biochim. Biophys. Acta, 
121, 1 (1966)). 

This draft has been prepared by a subcommittee consisting 
of J. S. Fruton, W. Klyne, and R. Schwyzer. The subcommittee 
is greatly indebted to many colleagues for helpful suggestions, 
notably to V. du Vigneaud, J. Rudinger, H. B. F. Dixon, and 
P. E. Verkade, chairman of the IUPAC Commission on Organic 
Nomenclature. 

These proposals are not suitable for application to "abnormal" 
links in a peptide sequence, e.g. to disulfide links or 7-peptide 
links. They are only suitable for modifications involving normal 
a-peptide links. 



1. Replacement 

In a polypeptide of trivial name X, if the qth amino acid 
residue (starting from the NH 2 -terminal end of the chain) is 
replaced by the amino acid residue Abe, the semitrivial name 
I of the modified polypeptide is [cp amino aeidhX and the abbre- 
viated form, chiefly for use in tables, is [AbC]-X. 
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Examples: 

[8-Citrulline]-vasopressin, [Cit 8 ]-vasopressin (Bodanszkv and 

Birkhimer, J. Am. Chem. Soc., 84, 4963 (1962)) 
[5-lsoleucine,7-a]anine]-lHpen.-ns-in II, [He 5 , AIa']-hvpertensm II 

(Seu, Smeby, and Burapus, ./. Am. Chem. Soc, 84, 3883 (1962)) 

Comments — (a) In the full name, the replacement amino 
acid is designated by its own fall name, not the name of its radical 
(cf. Rule 4 below). This name and the position of replacement 
are given in square brackets [ ], as for isotopic replacement. 

(b) In the abbreviated form, the amino acid residues are 
designated by the standard three-letter symbols (J. Biol. Chem., 
241, 527, 2491 (1966); Biochim. Biophys. Acta, 121, 1 (1966)), 
the first letter only being a capital, in square brackets [ ]. 

(c) In the abbreviated form, the position of substitution is 
indicated in a special fashion, i.e. by a superior numeral q , to 
indicate that it is a residue, not an individual atom, that is 
being replaced, and also for the reason indicated in Comment 
d. 

(d) The nature of the residue replaced is not designated in 
either the full or the abbreviated name. This is contrary to a 
general principle of organic nomenclature requiring that an 
atom (or group) that is replaced should (unless it is hydrogen) 
be clearly designated, as in 2 amino 2-deoxy-n-gIucose. It has 
been decided not to insist on the designation of the residue re- 
placed in these semitrivial names in order to keep the names as 
short as possible, and because the form of nomenclature in 
Rule 1 clearly differs from ordinary siibsfifuiiou nomenclature. 

(c) A partial analogy may be drawn with the form used for 
isotopic replacement, in which the isotope symbol is indicated 
in square brackets before the name. 

(f) The replacement oi an amino acid residue by its enantio- 
mer may be shown logically by the application of this rule as fol- 
lows: the replacement in X of L-alanine at position 7 by D-ala- 
nine results in [7-D-alanine)-X with the abbreviation [u-Ala'j-X. 
An example may be found in Bok-onnas, (bittman, and Pless, 
Ezperientia, 22, 526 (1966), dealing with the D-Ser 1 . . . deriva- 
tive of ^-corticotropin; the natural compound has L-serine in 
position 1. Another example is the [a-n-Asp : J- -hypertensin II 
of Riniker and Schwyzer ill eh. Chim. Acta, 47, 2357 (1964)). 

2. Extension 

The compounds obtained by tin < ,*>i,-.>n loh peptide X 
at either (o) the NH 2 -terminal end or (b) the. COO II -terminal 
end are designated by the kinds of names and abbreviations 
shown below. Th< m- 1 , 11 1 n 1 1] pram 

pies of polypeptide nomenclature (J. Biol. Chem., 241, 2491 
(1966): Biochim. Biophys. Ada, 121, 1 (1966)). 
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| f'\am; L - . I . ,n at NH r terminal end: 

Aroinoacyl-X Abc-X 
! e.g. Valyl-X Val-X 

; 01 Yah!- \ Val-Gly-X 

for extension by 2 residues. 
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(a) If the substitution is on a side chain ammo group of 
polypeptide X, the n tint i.t tt t )t t« njno md residue 
is written (with the termination "yl") and prefixed by symbols 
indicating the position of substitution (residue number and 
atom). 

Ivxainpie: An imaginary eompound (A; 



where X-yl is the l: 
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e of polypeptide X with the ending 
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I Comment. —This rule is not applicable to the extensi 
, ^ COOH-terminal of natural peptides having a terminal a car 
l|3lf boxamido group, as in the case of oxytocin or a-melanophore- 
-timulating hormone (a-MSH). It has been suggested that 
in w i ire' In 2 'i >o the peptides having a free terminal 
fflK»F a-carboxyl group (e.g. oxytocinoie acid) and that extension at 
jfSp the COOH-terminal end be denoted as in the example given 
above (e.g. oxytoeinoyl-Abc). 

3. Insertion 

The compound obtained by the insertion of an additional 
amino acid residue Abe in ihe position between the qth and 
(q + l)th residues of a polypeptide X is named qa-endo-amino- 
acid-X (abbreviated form, endo-Abe)-X). 
Example: 



4a-Eiido-tyrosiiie-hypertensiii II ; endo-Tyr <a -hypert 



■ II 



Comments — (a) This form has analogies in other fields in which 
endo implies the insertion of something into a structure (e.g. 
endomethylene). The prefix or index qa is based on analogies 
with the steroids where the atoms inserted in a ring after atom 
number q are designated qa, qb, etc. 

(b) The prefix homo is not suitable for designating the inser- 
tion of a whole residue, since it is commonly used to modify the 
names of individual amino acids, e.g. homoserine. 

(c) Multiple insertions and insertion of two or more residues 
I together in the same place in the chain are shown by a logical 
I extension of this rule. For example, the insertion into the poly- 
I peptide X of threonine between residues 4 and 5 and of valine 
I and glycine (in that order) between residues 6 and 7 is shown 
I by the name, "cndo-4a- threonine, 6a- valine, 6b-glycine-X" and 
I the abbreviation "endo-Thr 1 ", (Val 6: '~ Gly 6b )-X." 

4. Removal 

I The compound i .Warned by ihe formal removal of an amino 
I acid residue in position q from a polypeptide X is designated 
by the nami des-q-amii X, abbi ted Vbc q X. 
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The compound formed by the substitution of an additional 
imino acid residue as a side chain into a polypeptide X is named 



12 3 

in which a valyl group is substituted at the e-amino group of 
lysine at position 2 of the chain of a peptide X is named jV«*- 
valyl-X (abbreviated iV' 2 - Val-X). 

(b) If the substitution is on a Ade chain carboxyl group of 
polypeptide X. t he additional amino acid having a free a-carboxyl 
group, the substituted derivative is named by specifying the 
position of substitution (residue number and atom) and is given 
the designation "X-yl- amino acid." 

Example: An imaginary compound (B) 
Val 

B Ala-Leu-Glu-Ala... 
12 3 4 

in which a valine residue is substituted into the 5-carboxyl 
group of glutamic acid in position 3 of the chain of a peptide X 
would be named C ss -X-3d-valine (abbreviated C S3 -X-yl-Val). 

Comment — Note the importance of clear distinction from 
replacement as indicated in Rule 1 . 

6. Partial Sequences (Fragments) 
Polypeptide sequences that form fragments of a longer se- 
quence that already lias a trivial name may be designated a=s 
follows. The trivial name is followed by numbers giving the 
positions of the first and last amino acids, and then the usual 
Greek designation giving the number of amino acid units in the 
fragment, thus: 

Trivial name (-X-Y-) . . . peptide 
Example: from a-MSH 
Ac-Ser-Tyr-Scr-Met-Glu-His-Phe-Arg-Trp- 



-Gly-Lys-Pro -Val-NH, 
10 11 12 13 



We may have 

Met-Glu-His-Phe-Arg-Trp-Gly 



«-MSH-(4-10)-heptapeptidc 



The last example illustrates the nomenclature for a composite 
sequence of two fragments, and also for an amide-terminal group. 
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Fundamental name 




1 2 3 4 5 
Ala-Lys-Glu-Tyr-Leu 




Replacement 


[Phe'l-iiipaoiubin* 


4 

A 1 a-Lys-Glu-Phe-Leu 




K \ 1 ens i < us. XII ..-terminal 


Arginyi -iupaci ubi n 
Are . upaeiubin 


1 5 
Arg-Ala-Lys-Glu-Tyr-Leu 

1 5 




Extension, COOH-terminal 
Insertion 


1 i;.".i-n.:>> I Mel h 1 1 ■ i ; : : ; i ■ 

lu|Kiciubyl-Met 
Endo-Thr 2 *-iupaciubi n 


Ala-Lys-Glu-Tyr-Leu-Met 

2 2a 3 5 
Ala Lys-Thr-Glu-Tyr-Leu 




Removal 


Des-GIu 3 -iupaciubin 


2 4 
Ala-Lys-Tyr-Leu 

Val 




Side (-ham .substitution on amino group 


JV" ,s -Val-iupaciubin 


l< 

Ala-Lys-Glu-Tyr-Leu 
2 

Val 




Side chain substitution on earboxyl 
group 


C- i3 - Iupaciubyl-valinc 


1 « 

Ala-Lys-Glu-Tyr-Leu 
3 




Partial sequence 


Iupaciubin-(2-4)-tripeptide 


2 3 4 

Lya-GIu-Tyr 



"To symbolize the harmonious cooperation of IDPAC and IUB. 
'' Note that only fur rcplarement are square brackets required. 



